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Abstract 

In this paper, we consider rendering color videos using a 
non-photo-realistic art form technique commonly called stip- 
pling. Stippling is the art of rendering images using point 
sets, possibly with various attributes like sizes, elementary 
shapes, and colors. Producing nice stippling is attractive 
not only for the sake of image depiction but also because 
it yields a compact vectorial format for storing the semantic 
information of media. Moreover, stippling is by construction 
easily tunable to various device resolutions without suffering 
from bitmap sampling artifacts when resizing. The under- 
lying core technique for stippling images is to compute a 
centroidal Voronoi tessellation on a well-designed underly- 
ing density. This density relates to the image content, and 
is used to compute a weighted Voronoi diagram. By con- 
sidering videos as image sequences and initializing properly 
the stippling of one image by the result of its predecessor, 
one avoids undesirable point flickering artifacts and can pro- 
duce stippled videos that nevertheless still exhibit notice- 
able artifacts. To overcome this, our method improves over 
the naive scheme by considering dynamic point creation and 
deletion according to the current scene semantic complexity, 
and show how to effectively vectorize video while adjusting 
for both color and contrast characteristics. Furthermore, we 
explain how to produce high quality stippled "videos" (eg., 
fully dynamic spatio-temporal point sets) for media contain- 
ing various fading effects, like quick motions of objects or 
progressive shot changes. We report on practical perfor- 
mances of our implementation, and present several stippled 
video results rendered on-the-fly using our viewer that al- 
lows both spatio-temporal dynamic rescaling (eg., upscale 
vectorially frame rate). 

Keywords: Non-Photo-realistic rendering, Voronoi tes- 
sellations, Video, Stippling, Vectorization 

1 Introduction 

Historically, stippling was primarily used in the printing in- 
dustry to create dot patterns with various shade and size 
effects. This technique has proven successful to convey vi- 
sual information, and was widely adopted by artists that 
called this rendering art pointillism\^ Informally speaking, 
the main idea is that many dots carefully drawn on a pa- 
per can fairly approximate different tones perceived by local 
differences of density, as exemplified in Figure [1] To the Hu- 
man eyes, a high density region looks darker than a low den- 
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Figure 1: Stippling a photo by rendering the bitmap (left) 
as a point set (right) distributed uniformly according to a 
prescribed underlying image density: the grey-level intensity 
(here, 600 points). 



sity one. The main difference with dithering and half-toning 
methods is that points are allowed to be placed anywhere, 
and not only on a fixed regular grid. Thus the primary diffi- 
culty in stippling is to obtain a point distribution that adapts 
to a given density function, in the sense that the number of 
points in an area must be proportional to the underlying 
density. 

In this work, we present a method to generate stippled 
videos. Generating videos with stipples or other marks is 
very interesting but has not yet be fully solved. Our method 
produces high quality videos without point flickering arti- 
facts. Each point can easily be tracked during an entire shot 
sequence. Our algorithm is able to render fading effect^ 
with the adaptation of both point and color densities. Our 
method allows us to deal robustly with an entire video with- 
out having to cut it down into pieces using shot detection 
algorithms. To be able to work on the "output video" (time- 
varying point sets) and adjust dynamically some rendering 
parameters, we also developed an in-house video viewer that 
handles these vectorized stippled video. The viewer let us 
choose the contrast to adjust point section, let us use color 
codings or introduce patterns for drawing edges to improve 
the rendering quality. Moreover, the viewer application al- 
lows one to interpolate unknown frames at any time posi- 
tion to match the frame rate of the display device. Thus, we 
can easily increase the frame rate if required, or conversely 
slow down some portions of the stippled videos. To improve 
the edge render the viewer application offers one to replace 
points by patterns oriented with the local gradient of the 
current frame. 



Secord [Secord 2002 designed a non-interactive technique 
to create quality stippling still images in 2002. Based on 
the celebrated Lloyd's k- means method |Lloyd 1982| , Sec- 
ord creates stippling adapted to a density function and im- 



1 See http://www.randyglassstudio.com/ for some renderings 
by award-winning artist Randy Glass. 



2 Fading effects are so commonly met in practice in videos fol- 
lowing a storytelling that they cannot be neglected. 



proves the result by considering various point sizes. We con- 
sidered this approach of Stippling for our algorithm because 
it fits with the goal of our algorithm: being able to render an 
high quality output for every type of video without knowing 
any other data. There are other approaches like the use of 
multi-agent systems to distributes stipples has been explored 
by [Schlechtweg et al. 2005] . This way of rendering is easily 
extendable to various kind of stroke-based render. There is 
too a possibility of generating stipple patterns on 3D-objects 
by triangulating the surface and by positioning the dots on 
the triangle vertices Pastor ct al. 2003 . This is a very fast 
method but the resulting patterns are not optimal. This has 
been improved by [Vanderhaeghe et al. 2007 but we always 
cannot use as input every type of data — such as an AVI 
file taken on the internet. 

Like all computer-generated point distributions, we are 
also looking for stippled images that has the so-called blue 
noise characteristics with large mutual inter-distances be- 
tween points and no apparent regularity artifacts. An 
interesting approach of stippling with blue noise char- 
acteristics based on Penrose tiling is described by Os- 
tromoukhov Ostromoukhov ct al. 2004 . More recently 
a method based on Wang tiles enabled the creation of 
real-time stippling with blue noise characteristics. This 
method was described by |Kopf et al. 2006] . Balzer et 
al. [Balzer et al . 2009 developed another alternative way to 
create, this time Voronoi diagrams with blue noise charac- 
teristics. They introduced so-called capacity constrained 
diagrams that converge towards distributions that exhibit 
no regularity artifacts and which adaptation to given het- 
erogeneous density function behaves better. This capacity- 
constrained method enhances the spectral properties of point 
distribution while avoiding its drawbacks. Our algorithm 
can be used based on both Secord of Balzer et al. approaches 
and inherit of their blue noise characteristics. We choose the 
Secord algorithm for rapidity, or the Balzer et al. algorithm 
for high quality point distribution. 

Another way to render a stipple drawings is to mix different 
type of points instead of only use contrasted and coloured 
points. The work of S. Hiller Hiller et al. 2003 explored 
this possibility by positioning small patterns in a visually 
pleasing form in the stipple image. We use in our algorithm 
patterns to increase the render of the edges. 

We adapted these recent set of methods to video and solved 
along problems that appeared while doing so. A major 
inconvenient of these previous stippling methods was that 
while those methods can handle nicely and efficiently fine 
details and textures, they fall short when dealing with ob- 
jects with sharp edges. In this paper, we overcome this 
drawback and propose a frequency-based approach in or- 
der to detect images edges and enhanced their support and 
rendering. This frequency approach is improved by the use 
of patterns that replace some points and partially recon- 
struct the edges of the image. It is another way to use the 
frequency approach to improve the rendering of edges and 
this without increasing their size, but rather by suggesting 
their local orientation according to the surround shape ele- 
ments [Gomes et al. 2007j . 

The roadmap of the paper is as follows: The basic ingredient 
of our approach is the use of centroidal Voronoi diagrams (a 
fundamental structure of computational geometry) to pro- 
duce good-looking distributions of points, as explained in 



Section [2] This generative method will be further extended 
to compute a full video in Section 13.11 Then to improve 
the rendering, one need to change some parameters to ob- 
tain good fading effects, as explained in Section 13.21 An 
improved rendering must also consider both color informa- 
tion and point contrasts. Small adjustments described in 
Section [3. 31 are required to get such a desirable output. We 
implemented a tailored scheme to handle the stippling ren- 
dering of sharp edges. Section 13.41 explains our solution to 
improve edge rendering by considering both high and low fre- 
quencies to place points accordingly. Finally we used some 
oriented patterns to reconstruct sharp edges in the output. 
Section [3.51 explains the method used and the describes the 
obtained results. 

2 Voronoi diagrams 

In order to create a stipple image, we adapted Lloyd's k- 
means method Okabc ct al. 2000 to obtain a Voronoi di- 
agram that adapts to an underlying image density. To do 
this, we need two sets of points: 

The first point set which is called the support set of points 
is chosen following the grey-scale intensity density of 
the image. In fact, we choose in practice N points in 
the source image, where N can be larger than the total 
number of pixels in the image. Those support points 
should be drawn such that if a pixel is darker compared 
to another one, that pixel should have more chance to 
contain one or more support points. To get those sup- 
port points, we choose one point coordinate (x, y) at 
random, and then an integer falling in the grey inten- 
sity range [0,255]. If this integer is greater than 255 
minus the color of the pixel at (x, y), we decide to put 
a support point in (x,y). 

Then we generate the second set of points which will create 
the Voronoi diagram itself. We call these points: Sites. 
This second set of points has the same characteristics as 
the support points. Only the number of points changes; 
We choose only N/a points — were a — 10 3 for exam- 
ple. 

Then every generator point is identified with the closest site 
according to a given metric. The common choice is to use 
the Euclidean L 2 distance metric 

d(P 1 ,P 2 ) = ||Pl -P 2 || = V(zi-*2) 2 + (Vl-V2) 2 

where Pj = (xi,yi) are two given points. Each set of points 
identified with a particular site forms a Voronoi region, and 
the set of Voronoi regions "covers" the entire image. We can 
define each Voronoi region Vi as follows: 



Vi = {xen\ \\x- Xi\\ < \\x - xj\\ 

for j — 1, n and j ^ i j 

where ||.|| denotes the Euclidean distance derived from the 
L 2 norm. 

The following update step is performed to obtain a good- 
looking random distribution of sites: We move each site to 



the mass center of its associated generator points. Given 
a density function p(x) > defined on il, for any region 
V C f2, the standard mass center C of V is given by: 

/ xp(x)dx 



/ p(x)dx 
Jv 

Here, we consider p(x) = 1. Then we do again the identifi- 
cation for each generator point. Iterating these two steps let 
us converge to a Central Voronoi diagram |Du et al. 2 009 . 
a quasi-uniform partition of the domain adapted to a given 
image density. The CVT algorithm operations are summa- 
rized in Algorithm 1. 



Algorithm 1 Centroidal Voronoi Tesselation (CVT). 

1: Generation of support points: 

2: for i = 1 to Number of Sites * a do 

3: x — Random Integer £ [0; Width] 

4: y = Random Integer € [0; Height] 

5: m = Random Integer G [0; 255] 

6: if m > 255— Color(x, y) then 

7: Add support point at (x,y) 

8: i i + 1 

9: end if 
10: end for 
11: Generation of sites : 
12: for i = 1 to Number of Sites do 
13: x = Random Integer £ [0; Width] 
14: y = Random Integer € [0; Height] 
15: m = Random Integer € [0; 255] 
16: if m > 255-Color(x, y) then 
17: Add Site at (x,y) 
18: + 1 

19: end if 
20: end for 

21: Associate support points to closest sites 

22: for i = 1 to Number of Sites * a do 

23: Find the closest Site to the support point i 

24: d Smallest distance 

25: id Id of the nearest Site 

26: for j = 1 to Number of Sites do 

27: Calculate the distance Z between Site j and support 

point i 
28: if Z < d then 

29: d = I 

30: id = j 

31: end if 

32: end for 

33: Associate support point i with Site id 
34: end for 

35: while Convergence criteria < threshold do 
36: Move sites to mass center 
37: end while 



The convergence of Lloyd's algorithm to a centroidal Voronoi 
diagram on continuous domains has been proven for the one- 
dimensional case. Although the higher dimensional cases 
seem to converge similarly in practice, no formal proof is 
yet reported. Notice that here we fully discretize the space 
by considering support points. The criterion we used to 
define the convergence is the numeric comparison between 
the positions of each site between two consecutive iterations. 




Figure 2: Centroidal Voronoi Diagram (bottom) generated 
from successive Voronoi diagrams (top). Each site is iter- 
atively relocated to the centroid (the center of mass) of its 
Voronoi region. 



If the sum of those distances between the two positions of 
each site during the last iteration falls under a prescribed 
threshold, we consider that the computation is terminated. 
This discretized version of CVT provides good and quick 
results as attested in Figure [2] 



Another common solution is to stop Lloyd's relocation 
scheme after a prescribed number of iterations. La- 
gae [Lagac and Dutre 2008 suggested to use the normal- 
ized Poisson disk radius a € [0; 1] as a quality measure for 
point distributions. If two points in the distribution coin- 
cide, a = 0. If there is a hexagonal lattice, a — 1 and 
Lagae Lagac and Dutre 2008 chose a = 0.75 as optimal for 
a reference point set obtained via dart throwing (a common 
rejection sampling method). The convergence of a can be 
utilized as a termination criterion, where we stop Lloyd's 
method as soon as a becomes stable. 



3 CVT method for video stippling 
3.1 From images to video 

The first step consists in computing the first image stippling 
as explained in Section [5] We store the image and the posi- 
tion of each generator point and each site in a text file. The 
information we need for the remaining of the algorithm is 
the current density, the current number of generator points 
and the current number of sites. We need also to be able to 
identify each site. That is why we keep in the text file the 
identification number of all sites. This way of storing data 
gives us the possibility to pause our algorithm if needed, and 
resume it later. 



To compute a full video stippling, we first extract from the 
video an image sequence in order to be able to use image stip- 
pling methods. We then need to keep the information about 
stipple points on the N — 1 images before computing the N- 
th image. If we fail to doing so, no correlation between the 
images of the sequence will appear and undesirable buffering 
effects will appear in the synthesized stipple video. We start 
the computation of the iV-th image with the relocation of 
generator points and sites found for the (N — l)-th image 
by reading the corresponding text file. Then we seek for all 
the generator points and sites that are no longer needed in 
this new image. To decide whether to keep points or not, we 
generate a difference image where each pixel intensity is set 
as follows: 



Pdift(x,y) = if P N -i(x,y) - P N (x,y) < 
Pa\s{x,y) = P N ^ 1 (x,y) - P N (x,y), otherwise 



where Pjv(x,t/) is the color of the pixel at the coordinates 
(a;, y) of the image frame number N. 

Each generator point or site placed on a pixel where the color 
of the difference image is different from has a probability 
proportional to the value of the color to be deleted. If the 
value is 255 the probability is 100%. We do the same to 
find where we have to add new points by calculating another 
difference image as follows: 



Pdis{x,y) = if P N (x,y) - P N ~i(x,y) < 
P&m{x,y) = P N {x,y) - P N -i(x,y), otherwise 



We obtain two difference images as shown in Figure [3] 

Then we need to link the global density of the image with 
the total number of points. Thus we calculate for each image 
its global density which is equal to 



Read output file: 



Density d = 



EIU P(i 
255n 



where P(i) is the grey color of the pixel i and n is the total 
number of pixel. The user enter the initial number of points. 
This number of point is linked to the initial density and 
serves as a reference during the processing of the full video 
sequence. Our algorithm preserves the same ratio between 
the number of points and the image density during all the 
operations. 
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Draw points with these characteristics: 

N = FrameNumber 
Weight = N — [N\ 

x = X\jf\ * (1 — Weight) + asnyn * Weight 
V = V[N} * (1 - Weight) + j/pjvi * Weight 
Color = Color^j * (1 — Weight) + Color^i * Weight 
Color 

Size = * Contrast 

255 

Figure 4: The Java viewer first reads the source file, then 
renders on-the-fly the required frame by interpolating the 
frame and its characteristics from two consecutive discrete 
time frames. 



We repeat this step for all the images of the video sequence. 
When it is done, we need to produce a final file contain- 
ing all relevant information. Our program implementing the 
above stippling method reads all text files calculated during 
the process and store the evolution of each site identified 
by its own unique number. For each site we store at each 
frame its current position and color. With this information 
we generate a text file where we also add other important 
information such as the total number of sites that will ap- 
pear during the whole shot, the size of the video (eg., width 
and height dimensions), and the total number of frames. 

To read and "play" the output file, we have developed an in- 
house application in Java™ that renders the stippled video. 
With the information provided in the text file, the viewer 
application let us resize dynamically the video on-demand, 
change the contrast of the sites (the difference of size between 
a black site and a white one) and allows users to activate or 
not the color mode. Moreover, our application allows one 
to generate intermediate time frames by interpolating the 
position, size, and color of each site between two consecutive 
frames. Thus yielding true spatio-temporal vectorization of 
video media. 

The operation work flow carried out by the viewer to visu- 
alize stippled videos are explained in Figure [4] 

3.2 Image differences and drift adjustment of sites 
3.2.1 Handling fading effects 

The method described before is particularly well adapted to 
shots where objects appear and disappear instantly. The 
problem is that there are often objects that disappear using 
a fading effect. If we use the previous formula to calculate 
the image differences, we do not obtain a proper rendering 
for those fading transition shots. Indeed, if an object dis- 
appears progressively during three images (saym 33% each 
time), we suppress 33% of the generator points and site at 



Figure 3: From two consecutive frames (left), we generate two difference images (right). The evolution between the first two 
frames is shown on the central image. 



each time. But ((100 * 0.33) * 0.33) * 0.33 > 0. So there are 
generator points and sites in white zones of the image after 
the full removal of the object, which is of course not the 
expected result. To correct this step, we needed to find a 
difference formula that converges towards when the desti- 
nation image becomes fully white. We implemented in our 
program the following formula: 



If Pjv_i(i) < Pjv(i) we store in the difference image: 

255 



(p N (i) -Pv-i(i)) x 



255 - P N -i(i) 



If Pjv_i(i) > Pjv(i) we store in the other difference image: 

(PW*)-^«)x^ 
using the same notation as before. 



l(0 



This yields a perfect image/vector transcoding of the fading 
effect during a shot sequence. A work-out example is shown 
on figure [5] 

3.2.2 Drift correction 

Another major drawback observed is the drift of the sites 
which are on an edge of an object in the image. After 
putting the sites on the mass center of the associated gen- 
erator points, it appears that the sites are slightly located 
on the wrong side of the boundary, often on a white zone. 
To avoid this effect, we had to add another step in our al- 
gorithm. It is needed to do another step for removal and 
addition of sites and generator points to clean the image. 
Our algorithm is reported in Algorithm 2. 



Algorithm 2 Next image computation algorithm 

Difference images calculation 
2: New density calculation 

Total number of points and Sites update 
4: for i = 1 to 2 do 
i <— i + 1 

6: Support points and Sites suppression 

while Number of Sites < Total number of Site do 

8: Add Site 
end while 

10: while Number of support points < Total number of 
points do 

Add support point 
12: end while 

Move Sites to mass center 
14: end for 

while Convergence criteria < threshold do 
16: Move Sites to mass center 
end while 



To completely suppress this annoying artefact, we need to 
repeat the operation of suppression and the relocation of 



Figure 5: The text "Duffy" (left) fades out progressively to let the small character appear (right). The point density of 
Voronoi tesselations adapts automatically and progressively to this fading effect. (3000 points + 1500 frequency points) 



the sites several times until the number of deleted sites falls 
under a threshold. In practice, we noticed that repeating 
this step only one more time is already very efficient. The 
result is shown in Figure [6] 



mm 



Wm 



Figure 6: (Left) Some points drift from the original shape 
because of their identification with the centroid of their 
Voronoi region. (Right) our method removed this undesir- 
able effect. (2000 points) 



3.3 Color and contrast 

It would seem that to faithfully represent a grey-scale image 
of 256 levels, our method would require 256 support points 
per pixel of the original image. This is fortunately not the 
case. Indeed when rendering a stippling image we lose some 
information by the lack of support point. But we work on 
"areas" and not on "pixels" with stippling method. The 
most important information is the tone of each area of the 
image. The exact representation of the support of a stipple 
image is rather a segmented image without border and with 
gradient filled areas than a bunch of pixels. And in order 
to save the lost of "local" information we can easily improve 
the rendering by considering some other options. That is 
why we consider first the colour and the variation of the size 
of respective point sites. 

To each image, we can further add some properties to each 
site. It is easy to pick up the color of the pixel of the reference 
image which is at the same position of the site. With this 
information we can display the site in it original color and 
adapt its size in proportion to its grey color. A white site 
will be very small and a black site will have the biggest size. 
This simple operation improves considerably the perception 
of the rendering of the output stippled "video" and let the 
user distinguish more details in videos with low contrasts 
(see Figure [7J). 




Figure 7: (Top) Stippling result without contrast and color. 
(Middle) stippling with contrast. (Bottom) stippling with 
color to enhance the rendering. (3000 points) 



Figure 8: Discrete gradient Sobel operator on the image 
with a threshold that allows one to remove minor details. 



3.4 Frequency consideration 

In order to detect edges and enhance their support and ren- 
dering we considered a novel rendering approach. This oper- 
ation is carried out in parallel of the previous video process- 
ing. We first compute the discrete gradient at all pixels the 
images (using the Sobel operator) and filter the result with a 
threshold to put low frequencies at zero, as depicted in Fig- 
ure Then we apply the same algorithm with a number of 
site that can vary and that has to be adapted considering the 
number of pixels with high frequencies. For instance for a 
color image without much contrast like Lena, we have to add 
twice more points than the original number to have a nice 
render. But this has to be adapted by the user empirically. 

We obtain a distribution of sites located on the edges of the 
image. These edge points let us improve the overall render- 
ing of the shapes in the image. To get a good rendering, we 
merely add those sites to the sites previously calculated. To 
get a better rendering we reduce the size of these points by 
33% in comparison to the other points. The frequency-based 
stippling result is presented for Lena in the Figure [9] 

3.5 Pattern placement 

Detecting edges is easy but rendering them clearly by stip- 
pling is not really possible with a small amount of Sites. In 
order to keep a good render without having to generate a 
lot of Sites, we considered a way to replace the points used 
to represent the Sites previously obtained by the frequency 
approach, by patterns. The best solution is to use small seg- 
ments that follow the edge. Thus, with a small quantity of 
segments we are able to recreate the edges. 

After placing the patterns, we had to orient them perpen- 
dicularly to the local gradient of the image. To do so we 
just store for each image the local Sobel gradient following 
the x axis and the local Sobel gradient following the y axis. 
Then we estimate the local angle of the gradient with the 
following formula: 



Figure 10: Use of patterns to describe sharp edges in the 
image — 2000 segments. 



9(x, y) = arctan 




Once this operation is done we can associate each frequency 
Site with its orientation. We obtain on the Lena image the 
result shown on the figure [TUJ 

Contrary to the previous frequency approach, we need less 
point to obtain a pleasant render. Thus we increase the 
rapidity of our algorithm and save some place on the hard 
drive. To get a good rendering, we need to take in consider- 
ation the scale of these shapes into account in order to avoid 
the collision between them. This is possible by calculating 
an individual scaling factor for each of them. 

4 Experiments and results 
4.1 Blue noise characteristics 

Figure [TTJ shows a representative distribution of 1024 ran- 
dom points that was generated with Lloyd's method and ex- 
tracted form the paper of Balzer et al. [Balzer et al. 2009 . 
The distribution clearly exhibits regularities that are under- 
lined by the FFT analysis on the right of the figure. Our 
method generates a random set of points (here 3700 points) 
with less regularities. The results is substantially better and 
is conserved during a whole video. Two sets where generated 
and shown on figure [TT] The first has been made on a single 
image, with a direct point placement with our algorithm and 
present good characteristics. The second has been build in- 
side a video sequence by successive point addition frame by 
frame until the final number of 3700 points. We notice that 
this distribution has the same characteristics as the previous 
one. 

These blue noise characteristics are interesting because gen- 
erating a good random point set is a challenge. But we can't 
explain for now the reason of those results in our algorithm. 




Figure 9: We extract two density maps from the original source image: one is the color map and the other the frequency map. 
Then we apply our stippling algorithm and finally add both contrast and color information. We end the process by summing 
these two contributions — the classical (3000 points) and frequency approaches (6000 points). 



Spectral analysis 




Figure 11: Lloyd's method generates point distributions 
with regular structures if it is not stopped manually. The ex- 
ample set of 1024 points was computed with Lloyd's method 
to full convergence and contains regularities. The two other 
examples have been generated by our algorithm, the first di- 
rectly and the second after the computation of a whole video 
as the last frame of this video. They present less regularities. 
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Table 1: Timing experiments for a toy sequence of 10 
frames — in seconds. 



4.2 Algorithm's complexity 

Our method and software allows one to convert a video 
into a stippling video, namely a time-varying point set with 
dynamic color and size attributes. The complexity of our 
method only depends on the number of support/site points 
that we used to stipple the video. This number of points 
depends in turn on the following user input entries: 

• The number of sites; 

• The factor for calculating the number of support points 
— denoted by a previously. 

The complexity of the algorithm is quadratic, 0(n 2 ) where n 
is the number of sites. We have carried out several time mea- 
surements to confirm this overall complexity, and in order 
to identify some variations depending on other parameters. 
We have done the timing measures on a video containing 
10 frames. The video represents a black disk whose size 
decreases progressively itself 10% each frame. 

The timing statistics of that experiment are listed in Ta- 
ble [T] Each row stands for a frame (there are 10 frames), 
and each column denotes a number of point (from 1000 to 
10000). All timings are expressed in seconds. The graph on 
the Figure [12] plot the amount of time consumed to stipple 
a video depending on the number required sites. 



During the first set of measures, we used a video with a lot of 
point suppression. We noticed that if we compute the same 
video backward (so the disk is getting bigger each frame) 
for the same number of site, the time needed was different! 
In fact, we observed empirically that the addition of points 
requires more time than the operation of suppression. For 
3000 sites required on the frame with the biggest density, 
we needed 165 seconds to stipple the video and 602 seconds 
to compute the same video backward (the one with a lot of 
addition of points). Thus we emphasize on the fact that the 
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Figure 12: Graph representing the time required to calcu- 
late a 10-frame video with varying number of sites. 
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Figure 13: Evolution of the number of Site by frame in 
function of the density. 



stippling operation is not symmetrical although it visually 
leads to the same final result. 



To estimate the time required to compute the video we need 
to take into account the number of addition of points during 
the shot. This operation increases the overall computation 
time. Of course, one can accelerate our algorithm by porting 
it on graphics processor units (CPUs). The calculation of 
Voronoi diagrams is known to adapt well on GPU, and is 
far quicker and more efficient Vasconcclos ct al. 2008 than 
CPU-based algorithms. 

4.3 Size of the output file 

We measured for 3 different video shots of 9f frames the 
number of Sites depending on the frame density. The result 
is summed up on figure [T3l 



We noticed that algorithm maintains a perfect correlation 
between those to parameters following the initial ratio asked 
by the user. This observation implies that we can estimate 
the average number of Point needed to compute each frame. 
However, in order to estimate the total number of Sites 
needed — how many Sites are created during the compu- 
tation — we have first to know all the difference images and 
their density. Then we can estimate the total number of 
Sites with the formula: 



Where N represent the number of Sites and d the density. 

With this method we can quickly estimate the final size of 
the output video file. After a ZIP compression we observed 
that we need 200 bites per frame per f000 Sites. Thus for a 
stipple video with 25 frames per second, that lasts 1 hour, 
we encode a video in about 20 Mb. 

An possible way to improve the compression of the output 
file would be to store for each frame the characteristics of 
the points that are added and those of the suppressed point. 
For example for a video with 2000 Sites per frame and 3000 
different Sites used for the whole video we would need to 
store the characteristics of 3000 Sites, whereas our current 
method needs 2000 * iVFramc Sites. This improvement im- 
plies to consider that the Sites generated by our method are 
totally stable. In fact, they can move slightly during the 
shot to adapt with the variation of local intensity. But if 
we compute each with numerous iteration to reach a great 
stability, this movement is really small. So suppressing all 
the movement of point would result only on the complete re- 
moval of what we could see as the flickering effect. Indeed in 
our current video output the little jittering effect results of 
this movement and of a to big threshold taken for judging of 
the stability of the Site during the computation. This choice 
has been made to improve the rapidity of the computation 
of the video. 

4.4 Primary colours 

In order to be able to produce a render closer to some paint- 
ings we implements an algorithm that produces stippling 
with only primary coloured Sites — red, green and blue 
Sites. This extension is quite simple for a single image: we 
just convert each colour in red green or blue with a probabil- 
ity equal to its proportion of red, green or blue. An example 
is shown on Lena on figure [Ml But it raises some problems 
for video: 

• We have to maintain a high density of Sites. The eye 
makes a mean of the different colours an reconstruct the 
appropriate colour for each part of the image. That's 
why it is important to have a very high density of small 
points; 

• Having to maintain a high density of Sites increases the 
calculation time — proportional to n 2 where n is the 
number of Sites; 

• It increases the memory space needed to store the out- 
put video; 

• It increases the time needed to read each frame of the 
video and slower the frame rate in the viewer. 

Even if this approach generates drawings with only three 
colours, we looses some of the advantages of the Stippling 
process such as few colour transition, few storage space 
needed, few memory needed to read the video... To have 
a nice render we finally need to have a very high resolution 
of the stippling image and this put the stipple image with a 
number of Sites of the same order or about 10 times fewer as 
the number of pixel of the original. For example the image 




Figure 14: Stippling on Lena with primary colours. (15000 
points) 



on the figure [141 is drawn with 15000 Sites whereas the orig- 
inal image was composed of 256 * 256 = 65536 pixels, wich 
is only 4 times higher. 

5 Discussion and concluding remarks 

Stippling is an engaging art relating to non-photo-realistic 
depiction of images and videos by using point sets rendered 
with various color and size attributes. Besides the pure artis- 
tic and aesthetic interests of producing such renderings, the 
stippling process finds also many other advantages in its own: 

• The output video is fully vectorized, both on the spa- 
tial and temporal axes. Users can interactively rescale 
video to fit the device screen resolution and upscale the 
frame rate as wished for fluid animations. Stippling 
could thus be useful for web designers that have stor- 
age capacity constraints and yet would like to provide 
video contents that yields the same appearance on var- 
ious types of screen resolution devices (let it be PDA, 
laptop or TV). Furthermore, once the stippling pro- 
cess has been carried out, users can still personalized 
the rendering by tuning on-the-fly the size and other 
attributes of the points, without loosing much of the 
original semantic of the media. 

• Another characteristic of the stippling process is the 
production of a video that bear only a few pixel color 
transitions compared to the original medium. Indeed, 
on a usual video, we have Width x Height x -/VFr ame 
color transitions while in our output stippled video, 
we only have of number of transitions that is roughly 
AWnt x iVF rame . This is advantageous in terms of 
energy savings for e-book readers for instance. Stip- 
pling may potentially significantly increase battery life 
of such devices based on e-inks that consume energy 
only when flipping colors. 

• To improve the stippling process and correct the prob- 
lem of intensity loss of the stippled images (due to the 
averaging of the Human eyes), we can consider the area 
of each Voronoi cell and use this measure to extract a 
multiplicative normalization factor for the site of the 



corresponding Voronoi cell. With this normalization 
factor, we are able to nicely correct the loss of inten- 
sity of the stippled images; The smaller a Voronoi cell, 
the bigger the normalization factor, and the darker the 
color setting. 

Figure [T5] presents some extracted images from two stippled 
videos. The accompanying video (in mp4 format, play with 
Quicktime please) illustrates various steps of our algorithm, 
and describe results on various video. (An open source web 
applet allows one to play various stippled videos.) 
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Figure 15: These snapshots were extracted from two different output videos. See accompanying video. This second particular 
sequence does not take into account the frequency information of images. 



